67 research outputs found

    PhyloFunDB: A Pipeline to Create and Update Functional Gene Taxonomic Databases

    Get PDF
    The increase in sequencing capacity has amplified the number of taxonomically unclassified sequences in most databases. The classification of such sequences demands phylogenetic tree construction and comparison to currently classified sequences, a process that demands the processing of large amounts of data and use of several different software. Here, we present PhyloFunDB, a pipeline for extracting, processing, and inferring phylogenetic trees from specific functional genes. The goal of our work is to decrease processing time and facilitate the grouping of sequences that can be used for improved taxonomic classification of functional gene datasets

    CGtag: Complete genomics toolkit and annotation in a cloud-based Galaxy

    Get PDF
    Background: Complete Genomics provides an open-source suite of command-line tools for the analysis of their CG-formatted mapped sequencing files. Determination of; for example, the functional impact of detected variants, requires annotation with various databases that often require command-line and/or programming experience; thus, limiting their use to the average research scientist. We have therefore implemented this CG toolkit, together with a number of annotation, visualisation and file manipulation tools in Galaxy called CGtag (Complete Genomics Toolkit and Annotation in a Cloud-based Galaxy).Findings: In order to provide research scientists with web-based, simple and accurate analytical and visualisation applications for the selection of candidate mutations from Complete Genomics data, we have implemented the open-source Complete Genomics tool set, CGATools, in Galaxy. In addition we implemented some of the most popular command-line annotation and visualisation tools to allow research scientists to select candidate pathological mutations (SNV, and indels). Furthermore, we have developed a cloud-based public Galaxy instance to host the CGtag toolkit and other associated modules.Conclusions: CGtag provides a user-friendly interface to all research scientists wishing to select candidate variants from CG or other next-generation sequencing platforms' data. By using a cloud-based infrastructure, we can also assure sufficient and on-demand computation and storage resources to handle the analysis tasks. The tools are freely available for use from an NBIC/CTMM-TraIT (The Netherlands Bioinformatics Center/Center for Translational Molecular Medicine) cloud-based Galaxy instance, or can be installed to a local (production) Galaxy via the NBIC Galaxy tool shed

    Structural and functional variation in soil fungal communities associated with litter bags containing maize leaf

    Get PDF
    Soil fungi are key players in the degradation of recalcitrant organic matter in terrestrial ecosystems. To examine the organisms and genes responsible for complex organic matter degradation in soil, we tracked changes in fungal community composition and expressed genes in soil adjacent to mesh bags containing maize leaves undergoing decomposition. Using high-throughput sequencing approaches, changes in fungal community composition were determined by targeting 18S rRNA gene sequences, whereas community gene expression was examined via a metatranscriptomic approach. The majority of the 93 000 partial 18S rRNA gene sequences generated, were affiliated with the Ascomycota and Basidiomycota. Fungal diversity was at least 224 operational taxonomic units at the 97% similarity cutoff level. During litter degradation, the relative proportion of Basidiomycota increased, with a decrease in Ascomycota : Basidiomycota ratios over time. The most commonly detected decomposition-associated fungi included Agaricomycetes and Tremellales as well as unclassified Mucoromycotina. The majority of protein families found in the metatranscriptomic data were affiliated to fungal groups described to degrade plant-derived cellulose, such as Mucoraceae, Chaetomiaceae, Sordariaceae, Sebacinaceae, Tremellaceae, Psathyrellaceae and Schizophyllaceae. The combination of high-throughput rRNA gene-based and metatranscriptomic approaches provided perspectives into the organisms and genes involved in complex organic matter in soi

    Can subtle changes in gene expression be consistently detected with different microarray platforms?

    Get PDF
    Background: The comparability of gene expression data generated with different microarray platforms is still a matter of concern. Here we address the performance and the overlap in the detection of differentially expressed genes for five different microarray platforms in a challenging biological context where differences in gene expression are few and subtle. Results: Gene expression profiles in the hippocampus of five wild-type and five transgenic δC-doublecortin-like kinase mice were evaluated with five microarray platforms: Applied Biosystems, Affymetrix, Agilent, Illumina, LGTC home-spotted arrays. Using a fixed false discovery rate of 10% we detected surprising differences between the number of differentially expressed genes per platform. Four genes were selected by ABI, 130 by Affymetrix, 3,051 by Agilent, 54 by Illumina, and 13 by LGTC. Two genes were found significantly differentially expressed by all platforms and the four genes identified by the ABI platform were found by at least three other platforms. Quantitative RT-PCR analysis confirmed 20 out of 28 of the genes detected by two or more platforms and 8 out of 15 of the genes detected by Agilent only. We observed improved correlations between platforms when ranking the genes based on the significance level than with a fixed statistical cut-off. We demonstrate significant overlap in the affected gene sets identified by the different platforms, although biological processes were represented by only partially overlapping sets of genes. Aberrances in GABA-ergic signalling in the transgenic mice were consistently found by all platforms. Conclusion: The different microarray platforms give partially complementary views on biological processes affected. Our data indicate that when analyzing samples with only subtle differences in gene expression the use of two different platforms might be more attractive than increasing the number of replicates. Commercial two-color platforms seem to have higher power for finding differentially expressed genes between groups with small differences in expression

    Pathogen-induced activation of disease-suppressive functions in the endophytic root microbiome

    Get PDF
    Microorganisms living inside plants can promote plant growth and health, but their genomic and functional diversity remain largely elusive. Here, metagenomics and network inference show that fungal infection of plant roots enriched for Chitinophagaceae and Flavobacteriaceae in the root endosphere and for chitinase genes and various unknown biosynthetic gene clusters encoding the production of nonribosomal peptide synthetases (NRPSs) and polyketide synthases (PKSs). After strain-level genome reconstruction, a consortium of Chitinophaga and Flavobacterium was designed that consistently suppressed fungal root disease. Site-directed mutagenesis then revealed that a previously unidentified NRPS-PKS gene cluster from Flavobacterium was essential for disease suppression by the endophytic consortium. Our results highlight that endophytic root microbiomes harbor a wealth of as yet unknown functional traits that, in concert, can protect the plant inside out.</p

    Soil networks become more connected and take up more carbon as nature restoration progresses

    Get PDF
    Soil organisms have an important role in aboveground community dynamics and ecosystem functioning in terrestrial ecosystems. However, most studies have considered soil biota as a black box or focussed on specific groups, whereas little is known about entire soil networks. Here we show that during the course of nature restoration on abandoned arable land a compositional shift in soil biota, preceded by tightening of the belowground networks, corresponds with enhanced efficiency of carbon uptake. In mid- and long-term abandoned field soil, carbon uptake by fungi increases without an increase in fungal biomass or shift in bacterial-to-fungal ratio. The implication of our findings is that during nature restoration the efficiency of nutrient cycling and carbon uptake can increase by a shift in fungal composition and/or fungal activity. Therefore, we propose that relationships between soil food web structure and carbon cycling in soils need to be reconsidered

    A Comparison of rpoB and 16S rRNA as Markers in Pyrosequencing Studies of Bacterial Diversity

    Get PDF
    Background: The 16S rRNA gene is the gold standard in molecular surveys of bacterial and archaeal diversity, but it has the disadvantages that it is often multiple-copy, has little resolution below the species level and cannot be readily interpreted in an evolutionary framework. We compared the 16S rRNA marker with the single-copy, protein-coding rpoB marker by amplifying and sequencing both from a single soil sample. Because the higher genetic resolution of the rpoB gene prohibits its use as a universal marker, we employed consensus-degenerate primers targeting the Proteobacteria. &lt;p/&gt;Methodology/Principal Findings: Pyrosequencing can be problematic because of the poor resolution of homopolymer runs. As these erroneous runs disrupt the reading frame of protein-coding sequences, removal of sequences containing nonsense mutations was found to be a valuable filter in addition to flowgram-based denoising. Although both markers gave similar estimates of total diversity, the rpoB marker revealed more species, requiring an order of magnitude fewer reads to obtain 90% of the true diversity. The application of population genetic methods was demonstrated on a particularly abundant sequence cluster. &lt;p/&gt;Conclusions/Significance: The rpoB marker can be a complement to the 16S rRNA marker for high throughput microbial diversity studies focusing on specific taxonomic groups. Additional error filtering is possible and tests for recombination or selection can be employed

    Cultivation-independent and cultivation-dependent metagenomes reveal genetic and enzymatic potential of microbial community involved in the degradation of a complex microbial polymer

    No full text
    Background Cultivation-independent methods, including metagenomics, are tools for the exploration and discovery of biotechnological compounds produced by microbes in natural environments. Glycoside hydrolases (GHs) enzymes are extremely desired and important in the industry of production for goods and biofuel and removal of problematic biofilms and exopolysaccharide (EPS). Biofilms and EPS are complex, requiring a wide range of enzymes for a complete degradation. The aim of this study was to identify potential GH microbial producers and GH genes with biotechnological potential, using EPS-complex structure (WH15EPS) of Acidobacteria Granulicella sp. strain WH15 as an enrichment factor, in cultivation-independent and cultivation-dependent methods. We performed stable isotope probing (SIP) combined with metagenomics on topsoil litter amended with WH15EPS and coupled solid culture-EPS amended medium with metagenomics. Results SIP metagenome analysis of the soil litter demonstrated that phyla Proteobacteria, Actinobacteria, Acidobacteria, and Planctomycetes were the most abundant in WH15EPS amended and unamended treatments. The enrichment cultures in solid culture medium coupled to metagenomics demonstrated an enrichment in Proteobacteria, and the metagenome assembly of this enrichment cultures resulted in 4 metagenome-assembled genomes (MAGs) of microbes with low identity (42–86%) to known microorganisms. Among all carbohydrate-active enzymes (CAZymes) retrieved genes, glycoside transferase (GT) was the most abundant family, either in culture-independent or culture-based metagenome datasets. Within the glycoside hydrolases (GHs), GH13 was the most abundant family in both metagenome datasets. In the “heavy” fraction of the culture-independent metagenome SIP dataset, GH109 (α-N-acetylgalactosaminidases), GH117 (agarases), GH50 (agarases), GH32 (invertases and inulinases), GH17 (endoglucanases), and GH71 (mutanases) families were more abundant in comparison with the controls. Those GH families are affiliated to microorganism that are probably capable to degrade WH15EPS and potentially applicable for biofilm deconstruction. Subsequent in culture-based metagenome, the assembled 4 MAGs (unclassified Proteobacteria) also contained GH families of interest, involving mannosidases, lysozymes, galactosidases, and chitinases. Conclusions We demonstrated that functional diversity induced by the presence of WH15EPS in both culture-independent and culture-dependent approaches was enriched in GHs, such as amylases and endoglucanases that could be applied in chemical, pharmaceutical, and food industrial sectors. Furthermore, WH15EPS may be used for the investigation and isolation of yet unknown taxa, such as unclassified Proteobacteria and Planctomycetes, increasing the number of current cultured bacterial representatives with potential biotechnological traits

    PhyloFunDB: A Pipeline to Create and Update Functional Gene Taxonomic Databases

    No full text
    The increase in sequencing capacity has amplified the number of taxonomically unclassified sequences in most databases. The classification of such sequences demands phylogenetic tree construction and comparison to currently classified sequences, a process that demands the processing of large amounts of data and use of several different software. Here, we present PhyloFunDB, a pipeline for extracting, processing, and inferring phylogenetic trees from specific functional genes. The goal of our work is to decrease processing time and facilitate the grouping of sequences that can be used for improved taxonomic classification of functional gene datasets

    Video Byte: Forest floor microbes produce tough biofilm breaker: Exploring Solutions from Nature

    No full text
    Background Cultivation-independent methods, including metagenomics, are tools for the exploration and discovery of biotechnological compounds produced by microbes in natural environments. Glycoside hydrolases (GHs) enzymes are extremely desired and important in the industry of production for goods and biofuel and removal of problematic biofilms and exopolysaccharide (EPS). Biofilms and EPS are complex, requiring a wide range of enzymes for a complete degradation. The aim of this study was to identify potential GH microbial producers and GH genes with biotechnological potential, using EPS-complex structure (WH15EPS) of Acidobacteria Granulicella sp. strain WH15 as an enrichment factor, in cultivation-independent and cultivation-dependent methods. We performed stable isotope probing (SIP) combined with metagenomics on topsoil litter amended with WH15EPS and coupled solid culture-EPS amended medium with metagenomics. Results SIP metagenome analysis of the soil litter demonstrated that phyla Proteobacteria, Actinobacteria, Acidobacteria, and Planctomycetes were the most abundant in WH15EPS amended and unamended treatments. The enrichment cultures in solid culture medium coupled to metagenomics demonstrated an enrichment in Proteobacteria, and the metagenome assembly of this enrichment cultures resulted in 4 metagenome-assembled genomes (MAGs) of microbes with low identity (42–86%) to known microorganisms. Among all carbohydrate-active enzymes (CAZymes) retrieved genes, glycoside transferase (GT) was the most abundant family, either in culture-independent or culture-based metagenome datasets. Within the glycoside hydrolases (GHs), GH13 was the most abundant family in both metagenome datasets. In the “heavy” fraction of the culture-independent metagenome SIP dataset, GH109 (α-N-acetylgalactosaminidases), GH117 (agarases), GH50 (agarases), GH32 (invertases and inulinases), GH17 (endoglucanases), and GH71 (mutanases) families were more abundant in comparison with the controls. Those GH families are affiliated to microorganism that are probably capable to degrade WH15EPS and potentially applicable for biofilm deconstruction. Subsequent in culture-based metagenome, the assembled 4 MAGs (unclassified Proteobacteria) also contained GH families of interest, involving mannosidases, lysozymes, galactosidases, and chitinases. Conclusions We demonstrated that functional diversity induced by the presence of WH15EPS in both culture-independent and culture-dependent approaches was enriched in GHs, such as amylases and endoglucanases that could be applied in chemical, pharmaceutical, and food industrial sectors. Furthermore, WH15EPS may be used for the investigation and isolation of yet unknown taxa, such as unclassified Proteobacteria and Planctomycetes, increasing the number of current cultured bacterial representatives with potential biotechnological traits
    corecore